Rank in Wordlist | Frequency | Word |
---|---|---|
2933 | 7 | 3,000 |
4695 | 4 | 10,000 |
5977 | 3 | 334,0001 |
5983 | 3 | 50,000 |
5986 | 3 | 6,3001 |
7995 | 2 | 1,000 |
7997 | 2 | 100,000 |
8026 | 2 | 2,000 |
8044 | 2 | 30,000 |
8051 | 2 | 4,000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
11709 | 2 | pampadulas(lubricant |
13002 | 1 | ABR(Audio |
13759 | 1 | Conditions(T&C |
14311 | 1 | GATT(WTO |
14476 | 1 | HYM(Halang |
15128 | 1 | Kababaihan(Women`s |
16773 | 1 | PYM(Parish |
17008 | 1 | Park(SIP |
17342 | 1 | Rafael(ama |
17593 | 1 | Sekswal(Sexual |
Rank in Wordlist | Frequency | Word |
---|---|---|
12946 | 1 | 78:6-7)
|
13314 | 1 | BLB)-Resistant |
13826 | 1 | DA)matapos |
22521 | 1 | larawan)ay |
Rank in Wordlist | Frequency | Word |
---|---|---|
12589 | 1 | 12.6%kung |
Rank in Wordlist | Frequency | Word |
---|---|---|
13759 | 1 | Conditions(T&C |
17327 | 1 | R&B |
17328 | 1 | R&D |
24279 | 1 | milk&coffee |
Rank in Wordlist | Frequency | Word |
---|---|---|
18169 | 1 | US$14 |
Rank in Wordlist | Frequency | Word |
---|---|---|
9635 | 2 | ay"Pipit |
11425 | 2 | niya:"Ang |
18045 | 1 | Tinikling"-sayaw |
18706 | 1 | anuman,"Ama |
19596 | 1 | daigdig-"Angry |
20271 | 1 | gawa"(munkar |
20669 | 1 | hits-"Sabado |
21527 | 1 | is…"Battle |
21528 | 1 | is…"Only |
22465 | 1 | lady"sa |
Rank in Wordlist | Frequency | Word |
---|---|---|
276 | 68 | iba't |
1121 | 19 | iba't-ibang |
1268 | 17 | ito'y |
1863 | 12 | n'yo |
2321 | 9 | Bagama't |
2340 | 9 | Ito'y |
2395 | 9 | Xi'an |
2905 | 8 | siya'y |
3068 | 7 | ako'y |
3265 | 7 | pa'y |
Rank in Wordlist | Frequency | Word |
---|---|---|
14674 | 1 | I*sa |
Rank in Wordlist | Frequency | Word |
---|---|---|
3585 | 6 | diaphram/cap |
7666 | 3 | pasabing/skylab |
9245 | 2 | Sales/Support |
9620 | 2 | at/o |
12562 | 1 | 1/2/3 |
12563 | 1 | 1/3 |
12564 | 1 | 1/4 |
12730 | 1 | 2/3 |
12895 | 1 | 6/2-5 |
12896 | 1 | 6/Paliwanag |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots